Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 54
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Genome Biol Evol ; 15(10)2023 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-37776514

RESUMO

Gene duplication is recognized as a critical process in genome evolution; however, many questions about this process remain unanswered. Although gene duplicability has been observed to differ by duplication mechanism and evolutionary rate, there is so far no broad characterization of its determinants. Many features correlate with this difference in duplicability; however, our ability to exploit these observations to advance our understanding of the role of duplication in evolution is hampered by limitations within existing work. In particular, the existence of methodological differences across studies impedes meaningful comparison. Here, we use consistent definitions of duplicability in the human lineage to explore these associations, allow resolution of the impact of confounding factors, and define the overall relevance of individual features. Using a classifier approach and controlling for the confounding effect of duplicate longevity, we find a subset of gene features important in differentiating genes duplicable by small-scale duplication from those duplicable by whole-genome duplication, revealing critical roles for gene dosage and expression costs in duplicability. We further delve into patterns of functional enrichment and find a lack of constraint on duplicate retention in any context for genes duplicable by small-scale duplication.


Assuntos
Evolução Molecular , Genoma , Humanos , Genômica , Dosagem de Genes , Duplicação Gênica
3.
Nat Commun ; 14(1): 2879, 2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-37208359

RESUMO

Whole genome duplication (WGD) is a dramatic evolutionary event generating many new genes and which may play a role in survival through mass extinctions. Paddlefish and sturgeon are sister lineages that both show genomic evidence for ancient WGD. Until now this has been interpreted as two independent WGD events due to a preponderance of duplicate genes with independent histories. Here we show that although there is indeed a plurality of apparently independent gene duplications, these derive from a shared genome duplication event occurring well over 200 million years ago, likely close to the Permian-Triassic mass extinction period. This was followed by a prolonged process of reversion to stable diploid inheritance (rediploidization), that may have promoted survival during the Triassic-Jurassic mass extinction. We show that the sharing of this WGD is masked by the fact that paddlefish and sturgeon lineage divergence occurred before rediploidization had proceeded even half-way. Thus, for most genes the resolution to diploidy was lineage-specific. Because genes are only truly duplicated once diploid inheritance is established, the paddlefish and sturgeon genomes are thus a mosaic of shared and non-shared gene duplications resulting from a shared genome duplication event.


Assuntos
Duplicação Gênica , Máscaras , Animais , Genoma/genética , Peixes/genética , Evolução Biológica , Evolução Molecular , Filogenia
5.
Cell Rep ; 41(12): 111808, 2022 12 20.
Artigo em Inglês | MEDLINE | ID: mdl-36543139

RESUMO

Small open reading frames (sORFs) can encode functional "microproteins" that perform crucial biological tasks. However, their size makes them less amenable to genomic analysis, and their origins and conservation are poorly understood. Given their short length, it is plausible that some of these functional microproteins have recently originated entirely de novo from noncoding sequences. Here we sought to identify such cases in the human lineage by reconstructing the evolutionary origins of human microproteins previously found to have measurable, statistically significant fitness effects. By tracing the formation of each ORF and its transcriptional activation, we show that novel microproteins with significant phenotypic effects have emerged de novo throughout animal evolution, including two after the human-chimpanzee split. Notably, traditional methods for assessing coding potential would miss most of these cases. This evidence demonstrates that the functional potential intrinsic to sORFs can be relatively rapidly and frequently realized through de novo gene emergence.


Assuntos
Evolução Molecular , Hominidae , Animais , Humanos , Hominidae/genética , Genoma , Fases de Leitura Aberta/genética , Pan troglodytes
8.
PLoS Biol ; 20(7): e3001691, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35852990

RESUMO

Mendel, a genius experimentalist, meticulously uncovered the genetic basis of heredity in work that transformed the science of biology. But does the alluring simplicity of Mendel's laws sometimes obscure the true complexity of genetics?


Assuntos
Cognição , Genética , História do Século XIX
9.
Genome Biol Evol ; 14(1)2022 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-35018456

RESUMO

The faster rate of evolution of duplicated genes relative to singletons has been well documented in multiple lineages. This observation has generally been attributed to a presumed release from constraint following creation of a redundant, duplicate copy. However, it is not obvious that the relationship operates in this direction. An alternative possibility-that the faster rate of evolution predates the duplication event and the observed differences result from a higher propensity to duplicate in fast-evolving genes-has been tested in primates and in insects. However, these studies arrived at different conclusions and clarity is needed on whether these contrasting results relate to differences in methodology or legitimate biological differences between the lineages selected. Here, we test whether duplicable genes are faster evolving independent of duplication in the Drosophila lineage and find that our results support the conclusion that faster evolving genes are more likely to duplicate, in agreement with previous work in primates. Our findings indicate that this characteristic of gene duplication is not restricted to a single lineage and has broad implications for the interpretation of the impact of gene duplication. We identify a subset of "singletons" which defy the general trends and appear to be faster evolving. Further investigation implicates homology detection failure and suggests that these may be duplicable genes with unidentifiable paralogs.


Assuntos
Drosophila , Evolução Molecular , Animais , Drosophila/genética , Duplicação Gênica , Genes Duplicados , Primatas/genética
11.
Nat Commun ; 12(1): 4489, 2021 07 23.
Artigo em Inglês | MEDLINE | ID: mdl-34301952

RESUMO

Ancient polyploidization events have had a lasting impact on vertebrate genome structure, organization and function. Some key questions regarding the number of ancient polyploidization events and their timing in relation to the cyclostome-gnathostome divergence have remained contentious. Here we generate de novo long-read-based chromosome-scale genome assemblies for the Japanese lamprey and elephant shark. Using these and other representative genomes and developing algorithms for the probabilistic macrosynteny model, we reconstruct high-resolution proto-vertebrate, proto-cyclostome and proto-gnathostome genomes. Our reconstructions resolve key questions regarding the early evolutionary history of vertebrates. First, cyclostomes diverged from the lineage leading to gnathostomes after a shared tetraploidization (1R) but before a gnathostome-specific tetraploidization (2R). Second, the cyclostome lineage experienced an additional hexaploidization. Third, 2R in the gnathostome lineage was an allotetraploidization event, and biased gene loss from one of the subgenomes shaped the gnathostome genome by giving rise to remarkably conserved microchromosomes. Thus, our reconstructions reveal the major evolutionary events and offer new insights into the origin and evolution of vertebrate genomes.


Assuntos
Cromossomos/genética , Evolução Molecular , Genoma/genética , Modelos Genéticos , Vertebrados/genética , Animais , Variação Genética , Humanos , Lampreias/genética , Filogenia , Poliploidia , Análise de Sequência de DNA , Tubarões/genética , Sintenia , Vertebrados/classificação
12.
Nat Commun ; 12(1): 1783, 2021 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-33741994

RESUMO

Resolving the relationships between the major lineages in the animal tree of life is necessary to understand the origin and evolution of key animal traits. Sponges, characterized by their simple body plan, were traditionally considered the sister group of all other animal lineages, implying a gradual increase in animal complexity from unicellularity to complex multicellularity. However, the availability of genomic data has sparked tremendous controversy as some phylogenomic studies support comb jellies taking this position, requiring secondary loss or independent origins of complex traits. Here we show that incorporating site-heterogeneous mixture models and recoding into partitioned phylogenomics alleviates systematic errors that hamper commonly-applied phylogenetic models. Testing on real datasets, we show a great improvement in model-fit that attenuates branching artefacts induced by systematic error. We reanalyse key datasets and show that partitioned phylogenomics does not support comb jellies as sister to other animals at either the supermatrix or partition-specific level.


Assuntos
Ctenóforos/genética , Genoma/genética , Genômica/métodos , Filogenia , Poríferos/genética , Animais , Evolução Biológica , Ctenóforos/classificação , Modelos Genéticos , Poríferos/classificação , Especificidade da Espécie
14.
Nat Commun ; 11(1): 781, 2020 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-32034123

RESUMO

Recent evidence demonstrates that novel protein-coding genes can arise de novo from non-genic loci. This evolutionary innovation is thought to be facilitated by the pervasive translation of non-genic transcripts, which exposes a reservoir of variable polypeptides to natural selection. Here, we systematically characterize how these de novo emerging coding sequences impact fitness in budding yeast. Disruption of emerging sequences is generally inconsequential for fitness in the laboratory and in natural populations. Overexpression of emerging sequences, however, is enriched in adaptive fitness effects compared to overexpression of established genes. We find that adaptive emerging sequences tend to encode putative transmembrane domains, and that thymine-rich intergenic regions harbor a widespread potential to produce transmembrane domains. These findings, together with in-depth examination of the de novo emerging YBR196C-A locus, suggest a novel evolutionary model whereby adaptive transmembrane polypeptides emerge de novo from thymine-rich non-genic regions and subsequently accumulate changes molded by natural selection.


Assuntos
Evolução Molecular , Proteínas de Membrana/genética , Proteínas de Saccharomyces cerevisiae/genética , Fatores Associados à Proteína de Ligação a TATA/genética , Timina , Fator de Transcrição TFIID/genética , Adaptação Biológica/genética , Retículo Endoplasmático/genética , Retículo Endoplasmático/metabolismo , Regulação Fúngica da Expressão Gênica , Aptidão Genética , Membranas Intracelulares/metabolismo , Proteínas de Membrana/química , Fases de Leitura Aberta , Domínios Proteicos/genética , Saccharomyces cerevisiae/genética
15.
Elife ; 92020 02 18.
Artigo em Inglês | MEDLINE | ID: mdl-32066524

RESUMO

The origin of 'orphan' genes, species-specific sequences that lack detectable homologues, has remained mysterious since the dawn of the genomic era. There are two dominant explanations for orphan genes: complete sequence divergence from ancestral genes, such that homologues are not readily detectable; and de novo emergence from ancestral non-genic sequences, such that homologues genuinely do not exist. The relative contribution of the two processes remains unknown. Here, we harness the special circumstance of conserved synteny to estimate the contribution of complete divergence to the pool of orphan genes. By separately comparing yeast, fly and human genes to related taxa using conservative criteria, we find that complete divergence accounts, on average, for at most a third of eukaryotic orphan and taxonomically restricted genes. We observe that complete divergence occurs at a stable rate within a phylum but at different rates between phyla, and is frequently associated with gene shortening akin to pseudogenization.


Assuntos
Evolução Molecular , Genes/genética , Sintenia/genética , Animais , Sequência Conservada/genética , Drosophila melanogaster , Humanos , Filogenia , Saccharomyces cerevisiae , Homologia de Sequência
16.
Yeast ; 36(7): 425-437, 2019 07.
Artigo em Inglês | MEDLINE | ID: mdl-30963617

RESUMO

The sequencing of over a thousand Saccharomyces cerevisiae genomes revealed a complex pangenome. Over one third of the discovered genes are not present in the S. cerevisiae core genome but instead are often restricted to a subset of yeast isolates and thus may be important for adaptation to specific environmental niches. We refer to these genes as "pan-genes," being part of the pangenome but not the core genome. Here, we describe the evolutionary journey and characterisation of a novel pan-gene, originally named hypothetical (HYPO) open-reading frame. Phylogenetic analysis reveals that HYPO has been predominantly retained in S. cerevisiae strains associated with brewing but has been repeatedly lost in most other fungal species during evolution. There is also evidence that HYPO was horizontally transferred at least once, from S. cerevisiae to Saccharomyces paradoxus. The phylogenetic analysis of HYPO exemplifies the complexity and intricacy of evolutionary trajectories of genes within the S. cerevisiae pangenome. To examine possible functions for Hypo, we overexpressed a HYPO-GFP fusion protein in both S. cerevisiae and Saccharomyces pastorianus. The protein localised to the plasma membrane where it accumulated initially in distinct foci. Time-lapse fluorescent imaging revealed that when cells are grown in wort, Hypo-gfp fluorescence spreads throughout the membrane during cell growth. The overexpression of Hypo-gfp in S. cerevisiae or S. pastorianus strains did not significantly alter cell growth in medium-containing glucose, maltose, maltotriose, or wort at different concentrations.


Assuntos
Cerveja/microbiologia , Proteínas Fúngicas/genética , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/isolamento & purificação , Membrana Celular/metabolismo , Cromossomos Fúngicos/genética , Evolução Molecular , Proteínas Fúngicas/metabolismo , Deleção de Genes , Expressão Gênica , Transferência Genética Horizontal , Genoma Fúngico/genética , Fases de Leitura Aberta , Saccharomyces/classificação , Saccharomyces/genética , Saccharomyces/crescimento & desenvolvimento , Saccharomyces/isolamento & purificação , Saccharomyces cerevisiae/classificação , Saccharomyces cerevisiae/crescimento & desenvolvimento
18.
Methods Mol Biol ; 1851: 63-81, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30298392

RESUMO

De novo genes, that is, protein-coding genes originating from previously noncoding sequence, have gone from being considered impossibly unlikely to being recognized as an important source of genetic novelty in eukaryotic genomes. It is clear that de novo gene evolution is a rare but consistent feature of eukaryotic genomes, being detected in every genome studied. However, different studies often use different computational methods, and the numbers and identities of the detected genes vary greatly. Here we present a coherent protocol for the computational identification of de novo genes by comparative genomics. The method described uses homology searches, identification of syntenic regions, and ancestral sequence reconstruction to produce high-confidence candidates with robust evidence of de novo emergence. It is designed to be easily applicable given the basic knowledge of bioinformatic tools and scalable so that it can be applied on large and small datasets.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Sequência de Aminoácidos , Evolução Molecular , Filogenia , Proteínas/classificação , Proteínas/genética , Sintenia
19.
Mol Biol Evol ; 35(12): 2886-2899, 2018 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-30252115

RESUMO

The competitive endogenous RNA (ceRNA) hypothesis is an attractively simple model to explain the biological role of many putatively functionless noncoding RNAs. Under this model, there exist transcripts in the cell whose role is to titrate out microRNAs such that the expression level of another target sequence is altered. That it is logistically possible for expression of one microRNA recognition element (MRE)-containing transcript to affect another is seen in the multiple examples of pathogenic effects of inappropriate expression of MRE-containing RNAs. However, the role, if any, of ceRNAs in normal biological processes and at physiological levels is disputed. By comparison of parent genes and pseudogenes we show, both for a specific example and genome-wide, that the pseudo-3' untranslated regions (3'UTRs) of expressed pseudogenes are frequently retained and are under selective constraint in mammalian genomes. We found that the pseudo-3'UTR of BRAFP1, a previously described oncogenic ceRNA, has reduced substitutions relative to its pseudo-coding sequence, and we show sequence constraint on MREs shared between the parent gene, BRAF, and the pseudogene. Investigation of RNA-seq data reveals expression of BRAFP1 in normal somatic tissues in human and in other primates, consistent with biological ceRNA functionality of this pseudogene in nonpathogenic cellular contexts. Furthermore, we find that on a genome-wide scale pseudo-3'UTRs of mammalian pseudogenes (n = 1,629) are under stronger selective constraint than their pseudo-coding sequence counterparts, and are more often retained and expressed. Our results suggest that many human pseudogenes, often considered nonfunctional, may have an evolutionarily constrained role, consistent with the ceRNA hypothesis.


Assuntos
Catarrinos/genética , Evolução Molecular , Regulação da Expressão Gênica , Proteínas Proto-Oncogênicas B-raf/genética , Pseudogenes , Regiões 3' não Traduzidas , Animais , Sequência de Bases , Humanos
20.
Mol Cell ; 70(3): 408-421.e8, 2018 05 03.
Artigo em Inglês | MEDLINE | ID: mdl-29628311

RESUMO

The polycomb repressive complex 2 (PRC2) consists of core subunits SUZ12, EED, RBBP4/7, and EZH1/2 and is responsible for mono-, di-, and tri-methylation of lysine 27 on histone H3. Whereas two distinct forms exist, PRC2.1 (containing one polycomb-like protein) and PRC2.2 (containing AEBP2 and JARID2), little is known about their differential functions. Here, we report the discovery of a family of vertebrate-specific PRC2.1 proteins, "PRC2 associated LCOR isoform 1" (PALI1) and PALI2, encoded by the LCOR and LCORL gene loci, respectively. PALI1 promotes PRC2 methyltransferase activity in vitro and in vivo and is essential for mouse development. Pali1 and Aebp2 define mutually exclusive, antagonistic PRC2 subtypes that exhibit divergent H3K27-tri-methylation activities. The balance of these PRC2.1/PRC2.2 activities is required for the appropriate regulation of polycomb target genes during differentiation. PALI1/2 potentially link polycombs with transcriptional co-repressors in the regulation of cellular identity during development and in cancer.


Assuntos
Complexo Repressor Polycomb 2/genética , Proteínas Repressoras/genética , Vertebrados/genética , Sequência de Aminoácidos , Animais , Diferenciação Celular/genética , Linhagem Celular , Células HEK293 , Histonas/genética , Humanos , Metilação , Metiltransferases/genética , Camundongos , Neoplasias/genética , Alinhamento de Sequência
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...